Effects of Corpus Choice on Statistical Articulatory Modeling

نویسندگان

  • Olov Engwall
  • Jonas Beskow
چکیده

ABSTRACT: Statistical articulatory modeling is based on the analysis of a collected speech corpus to extract parameters that should be able to resynthesize the articulatory movements in the original corpus. As the parameters are based directly on the corpus, the size and content of the corpus have important effects on the model. In this paper we compare statistical articulatory models based on three corpora of different sorts: one rather exhaustive consisting of sentences, one limitied, but articulatory balanced and one limited and more unbalanced. The articulatory data was collected in a simultaneous recording with the electromagnetic articulograph Movetrack and the infrared stereo capture system Qualisys. The subject was a female native speaker of Swedish. The comparison of the different corpora focused on the articulatory space covered, the outcome of the statistical component analysis, articulatory speed and hyperarticulation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards unsupervised articulatory resynthesis of German utterances using EMA data

As part of ongoing research towards integrating an articulatory synthesizer into a text-to-speech (TTS) framework, a corpus of German utterances recorded with electromagnetic articulography (EMA) is resynthesized to provide training data for statistical models. The resynthesis is based on a measure of similarity between the original and resynthesized EMA trajectories, weighted by articulatory r...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Articulatory synthesis using corpus-based estimation of line spectrum pairs

An attempt to define a new articulatory synthesis method, in which the speech signal is generated through a statistical estimation of its relation with articulatory parameters, is presented. A corpus containing acoustic material and simultaneous recordings of the tongue and facial movements was used to train and test the articulatory synthesis of VCV words and short sentences. Tongue and facial...

متن کامل

Articulatory synthesis using corpus-based e

An attempt to define a new articulatory synthesis method, in which the speech signal is generated through a statistical estimation of its relation with articulatory parameters, is presented. A corpus containing acoustic material and simultaneous recordings of the tongue and facial movements was used to train and test the articulatory synthesis of VCV words and short sentences. Tongue and facial...

متن کامل

Can tongue be recovered from face? the answer of data-driven statistical models

This study revisits the face-to-tongue articulatory inversion problem in speech. We compare the Multi Linear Regression method (MLR) with two more sophisticated methods based on Hidden Markov Models (HMMs) and Gaussian Mixture Models (GMMs), using the same French corpus of articulatory data acquired by ElectroMagnetoGraphy. GMMs give overall results better than HMMs, but MLR does poorly. GMMs a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003